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Abstract 



The present invention is directed to a process for the preparation of a consensus protein specifically a 
phytase consensus protein, the consensus protein obtainable or obtained by such process and specific 
consensus protein mutants. 
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Table 5 (continued) 



phytase 


temperature optimum 


7m a 


A. nidulans 
M. thermophila 


45°C 
55 *C 


55.7 °C 



Claims 

io 1 . A process for the preparation of a consensus protein, whereby such process is characterized by the following steps: 

a) at least three, preferably four amino acid sequences are aligned by any standard alignment program known 
in the art; 

is b) amino acids at the same position according to such alignment are compared regarding their evolutionary 

similarity by any standard program known in the art, whereas the degree of similarity provided by such a 
program which defines the least similarity of the amino acids that is used for the determination of an amino 
acid of corresponding positions is set to a less stringent number and the parameters are set in such a way 
that it is possible for the program to determine from only 2 identical amino acids at a corresponding position 

20 an amino acid for the consensus protein; however, if among the compared amino acid sequences are se- 

quences that show a much higher degree of similarity to each other than to the residual sequences, these 
sequences are represented by their consensus sequence determined as defined in the same way as in the 
present process for the consensus sequence of the consensus protein or a vote weight of 1 divided by the 
number of such sequences is assigned to every of those sequences. 

25 

c) in case no common amino acid at a defined position is identified by the program, any of the amino acids, 
preferably the most frequent amino acid of all such sequences is selected; 

d) once the consensus sequence has been defined, such sequence is back-translated into a DNA sequence, 
so preferably by using a codon frequency table of the organism in which expression should take place; 

e) the DNA sequence is synthesized by methods known in the art and used either integrated into a suitable 
expression vector or by itself to transform an appropriate host cell; 

35 f) the transformed host cell is grown under suitable culture conditions and the consensus protein is isolated 

from the host cell or its culture medium by methods known in the art. 

2. A process as claimed in claim 1 wherein the program used for the comparison of amino acids at a defined position 
regarding their evolutionary similarity is the program "PRETTY". 

40 

3. A process as claimed in claim 1 or 2, wherein the defined protein family is the family of phytases. 

4. A process as claimed in claim 3, wherein the phytases are of fungal origin. 

45 5. A process as claimed in any one of claims 1 to 4, wherein the host cell is of eukaryotic origin. 

6. A process as claimed in claim 5, wherein eukaryotic means fungal, preferably Aspergillus or yeast, preferably 
Saccharomyces or Hansenula. 

so 7. A consensus protein obtainable, preferably obtained by a process as claimed in any one of claims 1 to 6. 

8. A consensus protein which has the amino acid sequence shown in Figure 2 or any variants or muteins thereof. 

9. A mutein of the consensus protein of claim 8 characterized therein that in the amino acid sequence of Figure 2 
ss the following replacements have been effected Q50L, Q50T, Q50G, Q50T-Y51 N or Q50L-Q51 N. 

1 0. A food, feed or pharmaceutical composition comprising a consensus protein as claimed in any of the claims 7 to 9. 
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Description 

[0001] Phytases (myo- inositol hexakisphosphate phosphohydrolases; EC 3.1.3.8) are enzymes that hydrolyze 
phytate (myo-inositol hexakisphosphate) to myoinositol and inorganic phosphate and are known to be valuable feed 
5 additives. 

[0002] A phytase was first described in rice bran in 1 907 [Suzuki et al., Bull. Coll. Agr. Tokio Imp. Univ. 7, 495 (1 907)] 
and phytases from Aspergillus species in 1911 [Dox and Golden, J. Biol. Chem.10, 183-186 (1911)]. Phytases have 
also been found in wheat bran, plant seeds, animal intestines and in microorganisms [Howsen and Davis, Enzyme 
Microb. Techno). 5, 377-382 (1983), Lambrechtsetal., Biotech. Lett. 14, 61-66 (1992), Shieh and Ware, Appl. Microbiol. 
10 16, 1348-1351 (1968)]. 

[0003] The cloning and expression of the phytase from Aspergillus niger (flcuum) has been described by Van Hart- 
ingsveldt et al., in Gene, 127, 87-94 (1993) and in European Patent Application, Publication No. (EP) 420 358 and 
from Aspergillus niger var. awamort by Piddington et al., in Gene 133, 55-62 (1993). 

[0004] Cloning, expression and purification of phytases with improved properties have been disclosed in EP 684 
is 313. However, since there is a still ongoing need for further improved phytases, especially with respect to their ther- 
mostability, it is an object of the present invention to provide the following process which is, however, not only applicable 
to phytases. 

[0005] A process for the preparation of a consensus protein, whereby such process is characterized by the following 
steps: 

20 

a) at least three preferably four amino acid sequences of a defined protein family are aligned by any standard 
alignment program known in the art; 

b) amino acids at the same position according to such alignment are compared regarding their evolutionary sim- 
25 ilarity by any standard program known in the art, whereas the degree of similarity provided by such a program 

which defines the least similarity of the amino acids that is used for the determination of an amino acid of corre- 
sponding positions is set to a less stringent number and the parameters are set in such a way that it is possible 
for the program to determine from only 2 identical amino acids at a corresponding position an amino acid for the 
consensus protein; however, if among the compared amino acid sequences are sequences that show a much 
30 higher degree of similarity to each other than to the residual sequences, these sequences are represented by thei r 

consensus sequence determined as defined in the same way as in the present process for the consensus sequence 
of the consensus protein or a vote weight of 1 divided by the number of such sequences is assigned to every of 
those sequences. 

35 c) in case no common amino acid at a defined position can be identified by the program, any of the amino acids 

of all sequences used for the comparison, preferably the most frequent amino acid of all such sequences is selected 
or an amino acid is selected on the basis of the consideration given in Example 2. 

d) once the consensus sequence has been defined, such sequence is back-translated into a DNA sequence, 
40 preferably using a codon frequency table of the organism in which expression should take place; 

e) the DNA sequence is synthesized by methods known in the art and used either integrated into a suitable ex- 
pression vector or by itself to transform an appropriate host cell; 

45 f) the transformed host cell is grown under suitable culture conditions and the consensus protein is isolated from 

the hosl cell or its culture medium by methods known in the art. 

[0006] In a preferred embodiment of this process step b) can also be defined as follows: 

so b) amino acids at the same position according to such an alignment are compared regarding their evolutionary 

similarity by any standard program known inthe art, whereas the degree of similarity provided by such program is 
set at the lowest possible value and the amino acid which is the most similar for at least half of the sequences 
used for the comparison is selected for the corresponding position in the amino acid sequence of the consensus 
protein. 

55 

[0007] A preferred embodiment of this whole process can be seen in a process in which a sequence is choosen from 
a number of highly homologous sequences and only those amino acid residues are replaced which clearly differ from 
a consensus sequence of this protein family calculated under moderately stringent conditions, while at all positions of 
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the alignment where the method is not able to determine an amino acid under moderately stringent conditions the 
amino acids of the preferred sequence are taken. 

[0008] It is furthermore an object of the present invention to provide such a process, wherein the program used for 
the comparison of amino acids at a defined position regarding their evolutionary similarity is the program "PRETTY". 
5 It is more specifically an object of the present invention to provide such a process, wherein the defined protein family 
is the family of phytases, especially wherein the phytases are of fungal origin. 

[0009] It is furthermore an object of the present invention to provide such processes, wherein the host cell is of 
eukaryotic, especially fungal, preferably Aspergillus or yeast, preferably Saccharomyces or Hansenula origin. 
[0010] It is also an object of the present invention to provide a consensus protein obtainable preferably obtained, by 

10 such processes and specifically the consensus protein, which has the amino acid sequence shown in Figure 2 or a 
variant thereof. A "variant" refers in the context of the present invention to a consensus protein with amino acid sequence 
shown in Figure 2 wherin at one or more positions amino acids have been deleted, added or replaced by one or more 
other amino acids with the provisor that the resulting sequence provides for a protein whose basic properties like 
enzymatic activity (type of and specific activity), thermostability, activity in a certain pH-range (pH-stability) have not 

*5 significantly been changed. "Signaficantly means in this context that a man skilled in the art would say that the properties 
of the variant may still be different but would not be unobvious over the ones of consensus protein with the amino acid 
sequence of Figure 2 itself. 

A mutein refers in the context of the present invention to replacements of the amino acid in the amino acid sequences 
of the consensus proteins shown in 
20 Figure 2 which lead to consensus proteins with further improved properties e. g. activity. Such muteins can be defined 
and prepared on the basis of the teachings given in European Patent Application number 97810175.6, e. g. Q50L, 
Q50T, Q50G, Q50L-Y51 N, or Q50T-Y51 N. "Q50L" means in this context that at position 50 of the amino acid sequence 
the amino acid Q has been replaced by amino acid L. 

[0011] In addition, a food, feed or pharmaceutical composition comprising a consensus protein as defined above is 

25 also an object of the present invention. 

[0012] In this context "at least three preferably three amino acid sequences of such defined protein family" means 
that three, four, five, six to 1 2, 20, 50 or even more sequences can be used for the alignment and the comparison to 
create the amino acid sequence of the consensus protein. "Sequences of a defined protein family" means that such 
sequences fold into a three dimensional structure, wherein the a-helixes, the (J-sheets and-turns are at the same 

30 position so that such structures are, as called by the man skilled in the art, superjmposable. Furthermore these se- 
quences characterize proteins which show the same type of biological activity, e.g. a defined enzyme class, e.g. the 
phytases. As known in the art, the three dimensional structure of one of such sequences is sufficient to allow the 
modelling of the structure of the other sequences of such a family. An example, how this can be effected, is given in 
the Reference Example of the present case. "Evolutionary similarity" in the context of the present invention refers to 

35 a schema which classifies amino acids regarding their structural similarity which allows that one amino acid can be 
replaced by another amino acid with a minimal influence on the overall structure, as this is done e.g. by programs, like 
"PRETTY", known in the art. The phrase "the degree of similarity provided by such a program. ..is set to less stringent 
number" means in the context of the present invention that values for the parameters which determine the degree of 
similarity in the prgram used in the practice of the present invention are chosen in a way to allow the program to define 

^0 a common amino acid for a maximum of positions of the whole amino acid sequence, e. g. in case of the program 
PRETTY a value of 2 or 3 for the THRESHOLD and a value of 2 for the PLURALITY can be choosen. Furthermore, 
"a vote weight of one devided by the number of such sequences" means in the context of the present invention that 
the sequences which define a group of sequences with a higher degree of similarity as the other sequences used for 
the determination of the consensus sequence only contribute to such determination with a factor which is equal to one 

45 devided by a number of all sequences of this group. 

As mentioned before should the program not allow to select the most similar amino acid, the most frequent amino acid 
is selected, should the latter be impossible the man skilled in the art will select an amino acid from all the sequences 
used for the comparison which is known in the art for its property to improve the thermostability in proteins as discussed 
e.g. by 

so 

Janecek, S. (1993), Process Biochem. 28, 435-445 or 
Fersht, A. R. & Serrano, L. (1993), Curr. Opin. Struct. Biol 3, 75-83. 
Alber, T (1989), Annu. Rev. Biochem. 58, 765-798 or 
Matthews, B. W. (1987), Biochemistry 26, 6885-6888. 
55 Matthews, B. W. (1991), Curr. Opin. Struct. Biol. 1, 17-21. 

[0013] The stability of an enzyme is a critical factor for many industrial applications. Therefore, a lot of attempts, 
more or less successful, have been made to improve the stability, preferably the thermostability of enzymes by rational 
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(van den Burg et al, 1 998) or irrational approaches (Akanuma et ai, 1 998). Tha forces influencing the thermostability 
of a protein are the same as those that are responsible for the proper folding of a peptide strand (hydrophobic inter- 
actions, van der Waals interactions, H-bonds, salt bridges, conformational strain (Matthews, 1993). Furthermore, as 
shown by Matthews et ai (1 987), the free energy of the unfolded state has also an influence on the stability of a protein. 

5 Enhancing of protein stability means to increase the number and strength of favorable interactions and to decrease 
the number and strength of unfavorable interactions. It has been possible to introduce disulfide linkages ( Sauer et at., 
1 986) to replace glycine with alanine residues or to increase the proline content in order to reduce the free energy of 
the unfolded state (Margarit etai, 1 992; Matthews, 1 987a). Other groups concentrated on the importance of additional 
H-bonds or salt bridges for the stability of a protein (Blaber et ai, 1 993) or tried to fill cavities in the protein interior to 

10 increase the buried hydrophobic surface area and the van der Waals interactions (Karpusas et a/., 1 989). Furthermore, 
the stabilization of secondary structure elements, especially a-helices, for example, by improved helix capping, was 
also investigated (Munoz & Serrano, 1995). 

[0014] However, there is no fast and promising strategy to identify amino acid replacements which will increase the 
stability, preferably the thermal stability of a protein. Commonly, the 3D structure of a protein is required to find locations 

J5 in the molecule where an amino acid replacement possibly will stabilize the protein's folded state. Alternative ways to 
circumvent this problem are either to search for a homologous protein in a thermo- or hyperthermophile organism or 
to detect stability-increasing amino acid replacements by a random mutagenesis approach. This latter possibility suc- 
ceeds in only 1 0 3 to 1 0 4 mutations and is restricted to enzymes for which a fast screening procedure is available (Arase 
et al, 1 993; Risse et at, 1 992). For all these approaches, success was variable and unpredictable and, if successful, 

20 the thermostability enhancements nearly always were rather small. 

[0015] Here we present an alternative way to improve the thermostability of a protein. Imanaka et ai (1986) were 
among the first to use the comparisons of homologous proteins to enhance the stability of a protein. They used a 
comparison of proteases from thermophilic with homologous ones of mesophilic organisms to enhance the stability of 
a mesophilic protease. Serrano et al. (1993) used the comparison of the amino acid sequences of two homologous 

2S mesophilic RNases to construct a more thermostable Rnase. They mutated individually all of the residues that differ 
between the two and combined the mutations that increase the stability in a multiple mutant. Pantoliano etai. (1989) 
and, in particular, Steipe et ai (1994) suggested that the most frequent amino acid at every position of an alignment 
of homologous proteins contribute to the largest amount to the stability of a protein. Steipe etai (1994) proved this for 
a variable domain of an immunoglobulin, whereas Pantoliano etai. (1 989) looked for positions in the primary sequence 

30 of subtilisin in which the sequence of the enzyme chosen to be improved for higher stability was singularly divergent. 
Their approach resulted in the replacement M50F which increased the T m , of subtilisin by 1 .8 °C. 
[0016] Steipe et al. (1994) proved on a variable domain of immunoglobulin that it is possible to predict a stabilizing 
mutation with better than 60% success rate just by using a statistical method which determines the most frequent 
amino acid residue at a certain position of this domain. It was also suggested that this method would provide useful 

35 results not only for stabilization of variable domains of antibodies but also for domains of other proteins. However, it 
was never mentioned that this method could be extended to the entire protein. Furthermore, nothing is said about the 
program which was used to calculate the frequency of amino acid residues at a distinct position or whether scoring 
matrices were used as in the present case. 

[0017] It is therefore an object of the present invention to provide a process for the preparation of a consensus protein 

40 comprising a process to calculate an amino acid residue for nearly all positions of a so-called consensus protein and 
to synthesize a complete gene from this sequence that could be expressed in a pro- or eukaryotic expression system. 
[001 8] DNA sequences of the present invention can be constructed starting from genomic or cDN A sequences coding 
for proteins, e.g. phytases known in the state of the art [for sequence information see references mentioned above, e. 
g. EP 684 313 or sequence data bases, for example like Genbank (Intelligenetics, California, USA), European Bioin- 

46 tormatics Institute (Hinston Hall, Cambridge, GB), NBRF (Georgetown University, Medical Centre, Washington DC, 
USA) and Vecbase (University of Wisconsin, Biotechnology Centre, Madison, Wisconsin, USA) or disclosed in the 
figures by methods of in vitro mutagenesis [see e.g. Sambrook et al., Molecular Cloning, Cold Spring Harbor Laboratory 
Press, New York]. A widely used strategy for such "site directed mutagenesis", as originally outlined by Hurchinson 
and Edgell [J. Virol. 8, 1 81 (1 971 )], involves the annealing of a synthetic oligonucleotide carrying the desired nucleotide 

so substitution to a target region of a single-stranded DNA sequence wherein the mutation should be introduced [for review 
see Smith, Anna Rev. Genet. 19, 423 (1985) and for improved methods see references 2-6 in Stanssen et al., Nucl. 
Acid Res., 4441-4454 (1989)]. Another possibility of mutating a given DNA sequence which is also preferred for 
the practiced the present invention is the mutagenesis by using the polymerase chain reaction (PCR). DNA as starting 
material can be isolated by methods known in the art and described e.g. in Sambrook et al. (Molecular Cloning) from 

55 the respective strains. For strain information see, e.g. EP 684 313 or any depository authority indicated below. As- 
pergillus niger [ATCC 9142], Myceliophthora thermophila [ATCC 48102], Talaromyces thermophilus [ATCC 201 86] and 
Aspergillus fumigatus [ATCC 34625] have been redeposited according to the conditions of the Budapest Treaty at the 
American Type Culture Cell Collection under the following accession numbers: ATCC 74337, ATCC 74340, ATCC 
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74338 and ATCC 74339, respectively. It is however, understood that DNA encoding a consensus protein in accordance 
with the present invention can also be prepared in a synthetic manner as described, e.g. in EP 747 483 or the examples 
by methods known in the art. 

[0019] Once complete DNA sequences of the present invention have been obtained they can be integrated into 
s vectors by methods known in the art and described e.g. in Sambrook et al. (s.a.) tooverexpress the encoded polypeptide 
in appropriate host systems. However, a man skilled in the art knows that also the DNA sequences themselves can 
be used to transform the suitable host systems of the invention to get overexpression of the encoded polypeptide. 
Appropriate host systems are for example fungi, like Aspergilli, e.g. Aspergillus niger [ATCC 91 42] or Aspergillus ficuum 
[NRRL 3135] or like Trichoderma, e.g. Trichoderma reesei or yeasts, like Saccharomyces, e.g. Saccharomyces cere- 
io visiae or Pichia, like Pichia pastoris, or Hansenula poiymorpha, e.g. H. polymorpha (DSM5215) or plants, as described, 
e.g. by Pen et al., Bio/Technology 11, 811-814 (1994). A man skilled in the art knows that such microorganisms are 
available from depository authorities, e.g. the American Type Culture Collection (ATCC), the Centraalbureau voor 
Schimmelcultures (CBS) or the Deutsche Sammlung fur Mikroorganismen und Zellkulturen GmbH (DSM) or any other 
depository authority as listed in the Journal "Industrial Property" [(1991)1, pages 29-40]. Bacteria which can be used 
is are e.g. E. coli, Bacilli as, e.g. Bacillus subtilis or Streptomyces, e.g. Streptomyces lividans (see e.g. Anne and Mallaert 
in FEMS Microbiol. Letters 114, 121 (1993). E. coli, which could be used are E. coli K12 strains e.g. M15 [described 
as DZ 291 by Villarejo et al. in J. Bacteriol. 120, 466-474 (1974)], HB 101 [ATCC No. 33694] or E. coli SG13009 
[Gottesman et al., J. Bacteriol. 148, 265-273 (1981)]. 

[0020] Vectors which can be used for expression in fungi are known in the art and described e.g. in EP 420 358, or 

20 by Cullen et al. [Bio/Technology 5, 369-376 (1 987)] or Ward in Molecular Industrial Mycology, Systems and Applications 
for Filamentous Fungi, Marcel Dekker, New York (1991), Upshall et al. [Bio/Technology 5, 1301-1304 (1987)] Gwynne 
et al. [Bio/Technology 5, 71-79 (1987)], Punt et al. [J. Biotechnol. 17, 19-34 (1991)] and for yeast by Sreekrishna et al. 
[J. Basic Microbiol. 28, 265-278 (1988), Biochemistry 2^ 4117-4125 (1989)], Hitzemann et al. [Nature 293, 717-722 
(1981)] or in EP 183 070, EP 183 071, EP 248 227, EP 263 311. Suitable vectors which can be used for expression 

25 in E. coli are mentioned, e.g. by Sambrook et al. [s.a.] or by Fiers et al. in Procd. 8th Int. Biotechnology Symposium" 
[Soc. Franc, de Microbiol., Paris (Durand et al., eds), pp. 680-697 (1 988)] or by Bujard et al. in Methods in Enzymology, 
eds. Wu and Grossmann, Academic Press, Inc. Vol. 155, 416-433 (19B7) and Stuber etai. in Immunological Methods, 
eds. Lefkovits and Pernis, Academic Press, Inc., Vol. IV, 121-152 (1990). Vectors which could be used for expression 
in Bacilli are known in the art and described, e.g. in EP 405 370, Procd. Natl. Acad. Sci. USA 81 , 439 (1 984) by Yansura 

30 and Henner, Meth. Enzymol. 185, 1 99-228 (1 990) or EP 207 459. Vectors which can be used for the expression in H. 
Polymorpha are known in the art and described, e.g. in Gellissen et al., Biotechnology 9, 291-295 (1991). 
[0021] Either such vectors already carry regulatory elements, e.g. promotors, or the DNA sequences of the present 
invention can be engineered to contain such elements. Suitable promotor elements which can be used are known in 
the art and are, e.g. for Trichoderma reesei the cbhl- [Haarki et al., Biotechnology 7, 596-600 (1989)] or the pki1-pro- 

35 motor [Schindler et al., Gene J30, 271-275 (1 993)], for Aspergillus oryzae the amy-promotor [Christensen et al., Abstr. 
19th Lunteren Lectures on Molecular Genetics F23 (1987), Christensen et al., Biotechnology 6, 1419-1422 (1988), 
Tadaetal., Mol. Gen. Genet. 229, 301 (1991)], for Aspergillus niger the glaA- [Cullen etal., Bio/Technology 5, 369-376 
(1987), Gwynne etal., Bio/Technology 5, 713-719 (1987), Ward in Molecular Industrial Mycology, Systems and Appli- 
cations for Filamentous Fungi, Marcel Dekker, New York, 83-106 (1991)], alcA- [Gwynne et al., Bio/Technology 5, 

40 718-719 (1987)], sud- [Boddy et al., Curr. Genet. 24, 60-66 (1993)], aphA- [MacRae et al., Gene 71, 339-348 (1988), 
MacRae et al. , Genel32, -1 93-1 98 (1 993)], tpiA-[McKnight et al. , Cell 46, 1 43-1 47 (1 986), Upshall et al. , Bio/Technology 
5, 1301-1304(1987)], gpdA- [Punt et al., Gene 69, 49-57 (1988), Punt et al., J. Biotechnol. 17, 19-37 (1991)] and the 
pkiA-promotor [de Graaff et al., Curr. Genet. 22, 21-27 (1992)]. Suitable promotor elements which could be used for 
expression in yeast are known in the art and are, e.g. the pho5-promotor [Vogel et al., Mol. Cell. Biol., 2050-2057 

45 (1989); Rudolf and Hinnen, Proc. Natl. Acad. Sci. 84, 1340-1344 (1987)] or the gap-promotor for expression in Sac- 
charomyces cerevisiae and for Pichia pastoris, e.g. the aoxl -promotor [Koutz et al., Yeast 5, 1 67-1 77 (1 989); Sreekrish- 
na et al., J. Basic Microbiol. 28, 265-278 (1988)], or the FMD promoter [Hollenberg et al., EPA No. 0299108] or MOX- 
promotor [Ledeboer et al., Nucleic Acids Res. _[3, 3063-3082 (1985)] for H. polymorpha. 

[0022] Accordingly vectors comprising DNA sequences of the present invention, preferably for the expression of said 
so DNA sequences in bacteria or a fungal or a yeast host and such transformed bacteria or fungal or yeast hosts are also 
an object of the present invention. 

[0023] It is also an object of the present invention to provide a system which allows for high expression of proteins, 
preferably phytases like the consensus phytase of the present invention in Hansenula characterized therein that the 
codons of the encoding DNA sequence of such a protein have been selected on the basis of a codon frequency table 
55 of the organism used for expression, e.g. yeast as in the present case (see e.g. in Example 3) and optionally the codons 
for the signal sequence have been selected in a manner as described for the specific case in Example 3. That means 
that a codon frequency table is prepared on the basis of the codons used in the DNA sequences which encode the 
amino acid sequences of the defined protein family. Then the codons for the design of the DNA sequence of the signal 
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sequence are selected from a codon frequency table of the host cell used for expression whereby always codons of 
comparable frequency in both tables are used. 

[0024] Once such DNA sequences have been expressed in an appropriate host cell in a suitable medium the encoded 
protein can be isolated either from the medium in the case the protein is secreted into the medium or from the host 

5 organism in case such protein is present intracellular^ by methods known in the art of protein purification or described 
in case of a phytase, e.g. in EP 420 358. Accordingly a process for the preparation of a polypeptide of the present 
invention characterized in that transformed bacteria or a host cell as described above is cultured under suitable culture 
conditions and the polypeptide is recovered therefrom and a polypeptide when produced by such a process or a 
polypeptide encoded by a DNA sequence of the present invention are also an object of the present invention. 

70 [0025] Once obtained the polypeptides of the present invent ion can be characterized regarding their properties which 
make them useful in agriculture any assay known in the art and described e.g. by Simons et al. [Br. J. Nutr. 6^ 525-540 
(1990)], Schoner et al. [J. Anim. Physiol, a. Anim. Nutr. 66, 248-255 (1991)], Vogt[Arch. Geflugelk. 56, 93-98 (1992)], 
Jongbloed et al. [J. Anim. Sci., 70, 1159-1168 (1992)], Perney et al. [Poultry Sci. 72, 2106-2114 (1993)], Farrell et al., 
[J. Anim. Physiol, a. Anim. Nutr. 69, 278-283 (1993), Broz et al.. (Br. Poultry Sci. 35, 273-280 (1994)] and Dungelhoef 

is et al. [Animal Feed Sci. Techno!. 49, 1-10 (1 994)] can be used. 

[0026] In general the polypeptides of the present invention can be used without being limited to a specific field of 
application, e.g. in case of phytases for the conversion of inositol polyphosphates, like phytate to inositol and inorganic 
phosphate. 

[0027] Furthermore the polypeptides of the present invention can be used in a process for the preparation of a 
20 pharmaceutical composition or compound food or feeds wherein the components of such a composition are mixed 
with one or more polypeptides of the present invention. Accordingly compound food or feeds or pharmaceutical com- 
positions comprising one or more polypeptides of the present invention are also an object of the present invention. A 
man skilled in the art is familiar with their process of preparation. Such pharmaceutical compositions or compound 
foods or feeds can further comprise additives or components generally used for such purpose and known in the state 
25 of the art. 

[0028] It is furthermore an object of the present invention to provide a process for the reduction of levels of phytate 
in animal manure characterized in that an animal is fed such a feed composition in an amount effective in converting 
phytate contained in the feedstuff to inositol and inorganic phosphate. 

[0029] Before describing the present invention in more detail a short explanation of the Tables and enclosed Figures 
30 is given below. 

[0030] Table 1: Vote weights of the amino acid sequences of the fungal phytases used. The table shows the vote 
weights used to calculate the consensus sequence of the fungal phytases. 

[0031] Table 2: Homology of the fungal phytases. The amino acid sequences of the phytases used in the alignment 
were compared by the program GAP (GCG program package, 9; Devereux et al, 1 984) using the standard parameters. 
35 The comparison was restricted to the part of the sequence that was also used for the alignment (see legend to Figure 
1 ) lacking the signal peptide which was rather divergent. The numbers above and beneath the diagonal represent the 
amino acid identities and similarities, respectively. 

[0032] Table 3: Homology of the amino acid sequence of fungal consensus phytase to the phytases used for its 
calculation. The amino acid sequences of all phytases were compared with the fungal consensus phytase sequence 
40 using the program GAP (GCG program package, 9.0). Again, the comparison was restricted to that part of the sequence 
that was used in the alignment. 

[0033] Table 4; Primers used for the introduction of single mutations into fungal consensus phytase. For the intro- 
duction of each mutation, two primers containing the desired mutation were required (see Example 8). The changed 
triplets are highlighted in bold letters. 

as [0034] Table 5: Temperature optimum and T m -value of fungal consensus phytase and ol the phytases from A. fumi- 
gatus, A. niger, A. nidulans, and M. thermophila. The temperature optima were taken from Figure 3. a The 7 m -values 
were determined by differential scanning calorimetry as described in Example 10 and shown in Figure 7. 
[0035] Figure 1: Calculation of the consensus phytase sequence from the alignment of nearly all known fungal 
phytase amino acid sequences. The letters represent the amino acid residues in the one-letter code. The following 

50 sequences were used for the alignment: phyA from Aspergillus terreus 9A-1 (Mitchell et al, 1997; from amino acid 
(aa) 27), phyAUom Aspergillus te/reuscbs1 16.46 (van Loon et al., 1997; from aa 27), pnyyAfrom Aspergillus nigervar. 
awamori (Piddington et al, 1993; from aa 27), phyA from Aspergillus n/oer T21 3; from aa 27), phyA from Aspergillus 
niger strain NRRL3135 (van Hartingsveldt er al, 1993; from aa 27), phyA from Aspergillus fumigatus ATCC 13073 
(Pasamontes et al, 1997b; from aa 25), phyA from Aspergillus fumigatus ATCC 32722 (van Loon et al., 1997; from 

55 aa 27), phyA from Aspergillus fumigatus ATCC 58128 (van Loon et al, 1997; from aa 27), phyA from Aspergillus 
fumigatus ATCC 26906 (van Loon et al , 1 997; from aa 27), phyA from Aspergillus fumigatus ATCC 32239 (van Loon 
et al, 1 997; from aa 30), phyA from Aspergillus nidulans (Pasamontes et al, 1 997a; from aa 25), phyA from Talaro- 
myces thermophilus (Pasamontes et al, 1997a; from aa 24), and phyA from Myceliophthora thermophila (Mitchell et 
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al, 1997; from aa 19). The alignment was calculated using the program PILEUP. The location of the gaps was refined 
by hand. Capitalized amino acid residues in the alignment at a given position belong to the amino acid coalition that 
establish the consensus residue. In bold, beneath the calculated consensus sequence, the amino acid sequence of 
the finally constructed fungal consensus phytase (Fcp) is shown. The gaps in the calculated consensus sequence 

5 were filled by hand according to principals stated in Example 2. 

[0036] Figure 2: DNA sequence of the fungal consensus phytase gene (fcp) and of the primers synthesized for gene 
construction. The calculated amino acid sequence (Figure 1) was converted into a DNA sequence using the program 
BACKTRANSLATE (Devereux et ai, 1984) and the codon frequency table of highly expressed yeast genes (GCG 
program package, 9.0). The signal peptide of the phytase from A terreus cbs was fused to the N-terminus. The bold 

io bases represent the sequences of the oligonucleotides used to generate the gene. The names of the respective oli- 
gonucleotides are noted above or below the sequence. The underlined bases represent the start and stop codon of 
the gene. The bases written in italics show the two introduced Eco Rl sites. 

[0037] Figure 3 : Temperature optimum of fungal consensus phytase and other phytases used to calculate the con- 
sensus sequence. For the determination of the temperature optimum, the phytase standard assay was performed at 
is a series of temperatures between 37 and 85 °C. The phytases used were purified according to Example 5. V, fungal 
consensus phytase; T, A fumigatus 13073 phytase; A. niger NRRL31 35 phytase; O, A. nidulans phytase; ■, A. 
ferreus9A-1 phytase; A. terreus cbs phytase. 

[0038] Figure 4 ; The pH-dependent activity profile of fungal consensus phytase and of the mutant Q50L, Q50T, and 
Q50G. The phytase activity was determined using the standard assay in appropriate buffers (see Example 9) at different 
20 pH-values. Plot a) shows a comparison of fungal consensus phytase (•) to the mutants Q50L (V), Q50T (T), and 
Q50G (O) in percent activity. Plot b) shows a comparison of fungal consensus phytase (O) to mutant Q50L (•) and 
Q50T (V) using the specific activity of the purified enzymes expressed in H. polymorphs. 

[0039] Figure 5 : The pH-dependent activity profile of the mutants Q50L, Y51N and Q50T, Y51N in comparison to 
the mutants Q50T and Q50L of fungal consensus phytase. The phytase activity was determined using the standard 

25 assay in appropriate buffers (see Example 9) at different pH-values. Graph a) shows the influence of the mutation 
Y51N (•) on mutant Q50L (O). Graph b) shows the influence of the same mutation (•) on mutant Q50T (O). 
[0040] Figure 6 : Substrate specificity of fungal consensus phytase and its mutants Q50L, Q50T, and Q50G. The bars 
represent the relative activity in comparison to the activity with phytic acid (100%) with a variety of known natural and 
synthetic phosphoryiated compounds. 

30 [0041] Figure 7 : Differential scanning calorimetry (DSC) of fungal consensus phytase and its mutant Q50T. The 
protein samples were concentrated to ca. 50-60 mg/ml and extensively dialyzed against 10 mM sodium acetate, pH 
5.0. A constant heating rate of 10 °C/min was applied up to 90 °C. DSC of consensus phytase Q50T (upper graph) 
yielded in a melting temperature of 78.9 °C, which is nearly identical to the melting point of fungal consensus phytase 
(78.1 °C, lower graph). 

35 

Examples 

Reference Example 

40 Homology Modeling of A fumigatus and A terreus cbsl 16.46 phytase 

[0042] The amino acid sequences of A. fumigatus and A. terreus cbsl 16.46 phytase were compared with the se- 
quence of A niger NRRL 3135 phytase (see Figure 1) for which the three-dimensional structure had been determined 
by X-ray crystallography. 

45 [0043] A multiple amino acid sequence alignment of A. n/ger NRRL 31 35 phytase, A fumigatus phytase and A terreus 
cbsl 16.46 phytase was calculated with the program "PILEUP" (Prog. Menu for the Wisconsin Package, version 8, 
September 1994, Genetics Computer Group. 575 Science Drive, Madison Wisconcin, USA 53711). The three-dimen- 
sional models of A fumigatus phytase and A terreus cbsl 16.46 phytase were built by using the structure of A. niger 
NRRL 3135 phytase as template and exchanging the amino acids of A rwger NRRL 3135 phytase according to the 

50 sequence alignment to amino acids of A fumigatus and A. terreus cbs 11 6. 46 phytases, respectively. Model construction 
and energy optimization were performed by using the program Moloc (Gerber and Muller, 1995). C-alpha positions 
were kept fixed except for new insertions/deletions and in loop regions distant from the active site. 
[0044] Only small differences of the modelled structures to the original crystal structure could be observed in external 
loops. Furthermore the different substrate molecules that mainly occur on the degradation pathway of phytic acid (myo- 

55 inositol -hexakisphosphate) by Pseudomonas sp. bacterium phytase and, as far as determined, by A. niger NRRL 31 35 
phytase (Cosgrove, 1980) were constructed and forged into the active site cavity of each phytase structure. Each of 
these substrates was oriented in a hypothetical binding mode proposed for histidine acid phosphatases (van Etten, 
1982). The scissile phosphate group was oriented towards the catalytically essential His 59 to form the covalent phos- 
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phoenzyme Intermediate. The oxygen of the substrate phosphoester bond which will be protonated by Asp 339 after 
cleavage was orientated towards the proton donor. Conformational relaxation of the remaining structural part of the 
substrates as well as the surrounding active site residues was performed by energy optimization with the program 
Moloc. 

5 [0045] Based on the structure models the residues pointing into the active site cavity were identified. More than half 
(60%) of these positions were identical between these three phytases, whereas only few positions were not conserved 
(see Figure 1 ). This observation could be extended to four additional phytase sequences (A nidutans, A terreus 9A1 , 
Talaromyces thermophilus, Myceliophthora thermophila). 

io Example 1 

Alignment of the amino acid sequence of the fungal phytases 

[0046] The alignment was calculated using the program PILEUP from the Sequence Analysis Package Release 9.0 
is (Devereux et at., 1 984) with the standard parameter (gap creation penalty 1 2, gap extension penalty 4). The location 
of the gaps was refined using a text editor. The following sequences (see Figure 1 ) without the signal sequence were 
used for the performance of the alignment starting with the amino acid (aa) mentioned below: 

phyA gene from Aspergillus terreus 9A-1 , aa 27 (Mitchell et at., 1997) 
20 phyA gene from Aspergillus terreus cbs 116.46, aa 27 (van Loon et at, 1 997) 

phyA gene from Aspergillus nigervar. awamori, aa 27 (Piddington et al, 1993) 
phyA gene from Aspergillus nigerT2*\3 t aa 27 

phyA gene from Aspergillus niger strain NRRL3135, aa 27 (van Hartingsveldt et al, 1993) 

phyA gene from Aspergillus fumigatus ATCC 13073, aa 26 (Pasamontes et al, 1997) 
25 phyA gene from Aspergillus fumigatus ATCC 32722, aa 26 (van Loon et al, 1 997) 

phyA gene from Aspergillus fumigatus ATCC 581 28, aa 26 (van Loon et al., 1 997) 

phyA gene from Aspergillus fumigatus ATCC 26906, aa 26 (van Loon et al, 1 997) 

phyA gene from Aspergillus fumigatus ATCC 32239, aa 30 (van Loon et al, 1 997) 

phyA gene from Aspergillus nidulans, aa 25 (Roche Nr. R1288, Pasamontes et al, 1997a) 
30 phyA gene from Talaromyces thermophilus ATCC 20186, aa 24 (Pasamontes et al, 1 997a) 

phyA gene from Myceliophthora thermophila, aa 19 (Mitchell et al, 1997) 

[0047] Table 2 shows the homology of the phytase sequences mentioned above. 
35 Example 2 

Calculation of the amino acid sequence of fungal consensus phytases 

[0048] Using the refined alignment of Example 1 as input, the consensus sequence was calculated by the program 
40 PRETTY from the Sequence Analysis Package Release 9.0 (Devereux et al, 1984). PRETTY prints sequences with 
their columns aligned and can display a consensus sequence for the alignment. A vote weight that pays regard to the 
similarity between the amino acid sequences of the phytases aligned were assigned to all sequences. The vote weight 
was set such as the combined impact of all phytases from one sequence subgroup (same species of origin but different 
strains), e. g. the amino acid sequences of all phytases from A fumigatus, on the election was set one, that means 
45 that each sequence contributes with a value of 1 divided by the number of strain sequences (see Table 1 ). By this 
means, it was possible to prevent that very similar amino acid sequences, e. g. of the phytases from different A fumi- 
gatus strains, dominate the calculated consensus sequence. 

[0049] The program PRETTY was started with the following parameters: The plurality defining the number of votes 
below which there is no consensus was set on 2.0. The threshold, which determines the scoring matrix value below 
50 which an amino acid residue may not vote for a coalition of residues, was set on 2. PRETTY used the PrettyPep.Cmp 
consensus scoring matrix for peptides. 

[0050] Ten positions of the alignment (position 46, 66, 82, 138, 162, 236. 276, 279, 280, 308; Figure 1), for which 
the program was not able to determine a consensus residue, were filled by hand according to the following rules: if a 
most frequent residue existed, this residue was chosen (1 38, 236, 280); if a prevalent group of chemically similar or 
55 equivalent residues occurred, the most frequent or, if not available, one residues of this group was selected (46, 66. 
82, 162, 276, 308). If there was either a prevalent residue nor a prevalent group, one of the occurring residues was 
chosen according to common assumption on their influence on the protein stability (279). Eight other positions (132, 
170, 204, 211, 275, 317, 384, 447; Figure 1) were not filled with the amino acid residue selected by the program but 
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normally with amino acids that occur with the same frequency as the residues that were chosen by the program. In 
most cases, the slight underrating of the three A niger sequences (sum of the vote weights: 0.99) was eliminated by 
this corrections. 

[0051] Table 3 shows the homology of the calculated fungal consensus phytase amino acid sequence to the phytase 
5 sequences used for the calculation. 

Example 3 

Conversion of the fungal consensus phytase amino acid sequence to a DNA sequence 

10 

[0052] The first 26 amino acid residues of A. terreus cbsl 1 6.46 phytase were used as signal peptide and, therefore, 
fused to the N-terminus of all consensus phytases. For this stretch, we used a special method to calculate the corre- 
sponding DNA sequence. Purvis etai (1987) proposed that the incorporation of rare codons in a gene has an influence 
on the folding efficiency of the protein. Therefore, at least the distribution of rare codons in the signal sequence of A 

1$ terreus cbs11 6.46, which was used for the fungal consensus phytase and which is very important for secretion of the 
protein, but converted into the S. cerevisiae codon usage, was transferred into the new signal sequence generated for 
expression in S. cerevisiae. For the remaining parts of the protein, we used the codon frequency table of highly ex- 
pressed S. cerevisiae genes, obtained from the GCG program package, to translate the calculated amino acid sequence 
into a DNA sequence. 

20 [0053] The resulting sequence of the fcp gene are shown in Figure 2. 

Example 4 

Construction and cloning of the fungal consensus phytase genes 

25 

[0054] The calculated DNA sequence of fungal consensus phytase was divided into oligonucleotides of 85 bp, al- 
ternately using the sequence of the sense and the anti-sense strand. Every oligonucleotide overlaps 20 bp with its 
previous and its following oligonucleotide of the opposite strand. The location of all primers, purchased by Microsynth, 
Balgach (Switzerland) and obtained in a PAGE-purified form, is indicated in Figure 2. 
30 [0055] In three PGR reactions, the synthesized oligonucleotides were composed to the entire gene. For the PCR, 
the High Fidelity Kit from Boehringer Mannheim (Boehringer Mannheim, Mannheim, Germany) and the thermo cycler 
The Protokol™ from AMS Biotechnology (Europe) Ltd. (Lugano, Switzerland) were used. 

[0056] Oligonucleotide CP-1 to CP-10 (Mix 1 , Figure 2) were mixed to a concentration of 0.2 pMol/u.1 per each oli- 
gonucleotide. A second oligonucleotide mixture (Mix 2) was prepared with CP-9 to CP-22 (0.2 pMol/uJ per each oligo- 
35 nucleotide). Additionally, four short primers were used in the PCR reactions: 

CP-a: Eco RI 

40 o-TAT ATG AAT TCA TGG GCG TGT TCG TC-3' 



CP-b: 

5'-TGA AAA GTT CAT TGA AGG TTT C-3' 

CP-c: 

5'-TCT TCG AAA GCA GTA CAA GTA C-3' 
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CP-e: 



EcoKL 

5*-TAT ATGAATTCT TAA GCG AAA C-3' 



PCR reaction a: 



10 



is PCR reaction b: 



20 



25 



30 



40 



10 p.l Mix 1 (2.0 pmol of each oligonucleotide) 
2 uJ nucleotides (10 mM each nucleotide) 
2 uJ primer CP-a (10 pmol/uJ) 
2 uJ primer CP-c (10 pmol/uJ) 
10,0 |il PCR buffer 
0.75 u.l polymerase mixture 
73.25 u.l H 2 0 

10 |il Mix 2 (2.0 pmol of each oligonucleotide) 
2 u.l nucleotides (10 mM each nucleotide) 
2 uJ primer CP-b (10 pmol/uJ) 
2 uJ primer CP-e (10 pmol/uJ) 
10,0 uJ PCR buffer 
0.75 jil polymerase mixture (2.6 U) 
73.25 uJ H 2 0 

step 1 2 min - 45°C 
step2 30sec-72°C 
step 3 30 sec - 94° C 
step 4 30 sec - 52° C 
step 5 1 min - 72°C 

Step 3 to 5 were repeated 40-times. 

[0057] The PCR products (670 and 905 bp) were purified by an agarose gel electrophoresis (0.9% agarose) and a 
following gel extraction (QIAEX II Gel Extraction Kit, Qiagen, Hilden, Germany). The purified DNA fragments were 
used for the PCR reaction c. 



Reaction conditions for PCR reaction a and b: 



35 PCR reaction c. 



Reaction conditions for PCR reaction c. 



45 



6 u.l PCR product of reaction a (=50 ng) 

6 u.l PCR product of reaction b (=50 ng) 

2 jil primer CP-a (10 pmol/uJ) 

2 uJ primer CP-e (10 pmol/uJ) 

10,0 uJ PCR buffer 

0.75 jil polymerase mixture (2.6 U) 

73.25 u.l H 2 0 

step 1 2 min - 94°C 

step 2 30 sec - 94°C 

step 3 30 sec - 55° C 

step 4 1 min - 72°C 



so 



Step 2 to 4 were repeated 31 -times. 

[0058] The resulting PCR product (1.4 kb) was purified as mentioned above, digested with Eco Rl, and ligated in an 
Eco Rl-digested and dephosphorylated pBsk(-)-vector (Stratagene, La Jolla, C A, USA). 1 uJ of the ligation mixture was 
used to transform E. coli XL-1 competent cells (Stratagene, La Jolla, CA, USA). All standard procedures were carried 
out as described by Sambrook et ai (1987). The constructed fungal consensus phytase gene {fcp) was verified by 
sequencing (plasmid pBsk-fcp). 



55 
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Example 5 

Expression of the fungal consensus phytase gene fcp and its variants in Saccharomyces cerevisiae and their 
purification from culture supernatant 

5 

[0059] A fungal consensus phytase gene was isolated from the plasmid pBsk-fcp ligated into the Eco Rl sites of the 
expression cassette of the Saccharomyces cerevisiae expression vector pYES2 (Invitrogen, San Diego, CA, USA) or 
subcloned between the shortened GAPFL (glyceraldhyde-3-phosphate dehydrogenase) promoter and the pho5 ter- 
minator as described by Janes et al. (1 990). The correct orientation of the gene was checked by PCR. Transformation 

io of S. cerevisiae strains, e. g. INVSd (Invitrogen, San Diego, CA, USA) was done according to Hinnen et al. (1978). 
Single colonies harboring the phytase gene under the control of the GAPFL promoter were picked and cultivated in 5 
ml selection medium (SD-uracil, Sherman et al, 1986) at 30°C under vigorous shaking (250 rpm) for one day. The 
preculture was then added to 500 ml YPD medium (Sherman et al., 1986) and grown under the same conditions. 
Induction of the gall promoter was done according to manufacturer's instruction. After four days of incubation cell broth 
was centrifuged (7000 rpm, GS3 rotor, 15 min, 5°C) to remove the cells and the supernatant was concentrated by way 
of ultrafiltration in Amicon 8400 cells (PM30 membranes) and ultrafree-15 centrifugal filter devices (Biomax-30K, Mil- 
lipore, Bedford, MA, USA). The concentrate (10 ml) was desalted on a 40 ml Sephadex G25 Superfine column (Phar- 
macia Biotech, Freiburg, Germany), with 1 0 mM sodium acetate, pH 5.0, serving as elution buffer. The desalted sample 
was brought to 2 M (NH 4 ) 2 S0 4 and directly loaded onto a 1 ml Butyl Sepharose 4 Fast Flow hydrophobic interaction 

20 chromatography column (Pharmacia Biotech, Feiburg, Germany) which was eluted with a linear gradient from 2 M to 
0 M (NH4) 2 S0 4 in 10 mM sodium acetate, pH 5.0. Phytase was eluted in the break-through, concentrated and loaded 
on a 1 20 ml Sephacryl S-300 gel permeation chromatography column (Pharmacia Biotech, Freiburg, Germany). Fungal 
consensus phytase and fungal consensus phytase 7 eluted as a homogeneous symmetrical peak and was shown by 
SDS-PAGE to be approx. 95% pure. 

25 

Example 6 

Expression of the fungal consensus phytase genes fcp and its variants in Hansenula polvmorpha 

30 [0060] The phytase expression vectors, used to transform H. polymorpha, was constructed by inserting the Eco Rl 
fragment of pBsk-fcp encoding the consensus phytase or a variant into the multiple cloning site of the H. polymorpha 
expression vector pFPMT121, which is based on an ura3 selection marker and the FMD promoter. The 5* end of the 
fcp gene is fused to the FMD promoter, the 3' end to the MOX terminator (Gellissen et al., 1996; EP 0299 108 B). The 
resulting expression vector are designated pFPMTfcp and pBsk' fcp'7. 

35 [0061] The constructed plasmids were propagated in E. coli. Plasmid DNA was purified using standard state of the 
art procedures. The expression plasmids were transformed into the H. polymorpha strain RP11 deficient in orotidine- 
5'-phosphate decarboxylase (ura3) using the procedure for preparation of competent cells and for transformation of 
yeast as described in Gelissen et al. (1996). Each transformation mixture was plated on YNB (0.14% w/v Difco YNB 
and 0.5% ammonium sulfate) containing 2% glucose and 1 .8% agar and incubated at 37 *C. After 4 to 5 days individual 

40 transformant colonies were picked and grown in the liquid medium described above for 2 days at 37 °C. Subsequently, 
an aliquot of this culture was used to inoculate fresh vials with YNB-medium containing 2% glucose. After seven further 
passages in selective medium, the expression vector integrates into the yeast genome in multimeric form. Subsequent- 
ly, mitotically stable transformants were obtained by two additional cultivation steps in 3 ml non-selective liquid medium 
(YPD, 2% glucose, 10 g yeast extract, and 20 g peptone). In order to obtain genetically homogeneous recombinant 

is strains an aliquot from the last stabilization culture was plated on a selective plate. Single colonies were isolated for 
analysis of phytase expression in YNB containing 2% glycerol instead of glucose to derepress the fmd promoter. Pu- 
rification of the fungal consensus phytases was done as described in Example 5. 

Example 7 

so 

Expression of the fungal consensus genes fcp and its variants in Aspergillus niper 

[0062] Plasmid pBskVcp or the corresponding plasmid of a variant of the fcp gene were used as template for the 
introduction of a Bsp HI -site upstream of the start codon of the genes and an Eco RV-site downstream of the stop 
55 codon. The Expand™ High Fidelity PCR Kit (Boehringer Mannheim, Mannheim, Germany) was used with the following 
primers: 
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Primer Asp-1: 

BspHL 

5'-TAT ATC ATG AGC GTG TTC GTC GTG CTA CTG TTC-3' 



Primer Asp-2 for cloning of fcp and fcpl: 

3'-ACC CGA CTT ACA AAG CGA ATT CTA TAG ATA TAT- 5' 

EcoRV 

75 

[0063] The reaction was performed as described by the supplier. The PCR-amplified fcp gene had a new Bsp HI site 
at the start codon, introduced by primer Asp-1, which resulted in a replacement of the second amino acid residue 
glycine by serine. Subsequently, the DNA-fragment was digested with Bsp HI and Eco RV and ligated into the Nco I 

20 site downstream of the glucoamylase promoter of Aspergillus niger (glaA) and the Eco RV site upstream of the As- 
pergillus nidulans tryptophan C terminator (trpC) (Mullaney et at, 1985). After this cloning step, the genes were se- 
quenced to detect possible failures introduced by PCR. The resulting expression plasmids which basically corresponds 
to the pGLAC vector as described in Example 9 of EP 684 313, contained the orotidine-5'-phosphate decarboxylase 
gene (pyr4) of Neurospora crassa as a selection marker. Transformation of Aspergillus niger and expression of the 

25 consensus phytase genes was done as described in EP 684 313. The fungal consensus phytases were purified as 
described in Example 5. 

Example 8 

30 Construction of muteins of fungal consensus phytase 

[0064] To construct muteins for expression in A. niger, S. cerevisiae, or H polymorpha, the corresponding expression 
plasmid containing the fungal consensus phytase gene was used as template for site-directed mutagenesis. Mutations 
were introduced using the "quick exchange™ site-directed mutagenesis kit" from Stratagene ( La Jolla, CA, USA) 
35 following the manufacturer's protocol and using the corresponding primers. All mutations made and the corresponding 
primers are summarized in Table 4. Clones harboring the desired mutation were identified by DNA sequence analysis 
as known in the art. The mutated phytase were verified by sequencing of the complete gene. 

Example 9 

40 

. Determination of the phytase activity and of the temperature optimum of the consensus phytase and its variants 

[0065] Phytase activity was determined basically as described by Mitchell et al. (1997). The activity was measured 
in a assay mixture containing 0.5% phytic acid (=5 mM), 200 mM sodium acetate, pH 5.0. After 15 min incubation at 

45 37 °C, the reaction was stopped by addition ot an equal volume of 15% trichloroacetic acid. The liberated phosphate 
was quantified by mixing 100 uJ of the assay mixture with 900 uJ H z O and 1 ml Of 0.6 M H 2 S0 4 , 2% ascorbic acid and 
0.5% ammonium molybdate. Standard solutions of potassium phosphate were used as reference. One unit of enzyme 
activity was defined as the amount of enzyme that releases 1 ujnol phosphate per minute at 37 °C. The protein con- 
centration was determined using the enzyme extinction coefficient at 280 nm calculated according to Pace el al. (1 995): 

so fungal consensus phytase, 1.101; fungal consensus phytase 7, 1 .068. 

[0066] In case of pH-optimum curves, purified enzymes were diluted in 10 mM sodium acetate, pH 5.0. Incubations 
were started by mixing aliquots of the diluted protein with an equal volume of 1% phytic acid (=10 mM) in a series of 
different buffers: 0.4 M gfycine/HCl, pH 2.5; 0.4 M acetate/NaOH, pH 3.0, 3.5, 4.0, 4.5, 5.0, 5.5; 0.4 M imidazole/HCI, 
pH 6.0, 6.5; 0.4 M Tris/HCI pH 7.0, 7.5, 8.0, 8.5, 9.0. Control experiments showed that pH was only slightly affected 

ss by the mixing step. Incubations were performed for 15 min at 37 °C as described above. 

[0067] For determination of the substrate specificities of the phytases, phytic acid in the assay mixture was replaced 
by 5 mM concentrations of the respective phosphate compounds. The activity tests were performed as described above. 
[0068] For determination of the temperature optimum, enzyme (100 uJ) and substrate solution (100 were pre- 
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incubated tor 5 min at the given temperature. The reaction was started by addition of the substrate solution to the 
enzyme. After 1 5 min incubation, the reaction was stopped with trichloroacetic acid and the amount of phosphate 
released was determined. 

[0069] The pH-optimum of the original fungal consensus phytase was around pH 6.0-6.5 (70 U/mg). By introduction 
5 of the Q50T mutation, the pH-optimum shifted to pH 6.0 (130 U/mg), while the replacement by a leucine at the same 
position resulted in a maximum activity around pH 5.5 (212 U/mg). The exchange Q50G resulted in a pH-optimum of 
the activity above pH 6.0 (see Figure 4). The exchange of tyrosine at position 51 with asparagine resulted in a relative 
increase of the activity below pH 5.0 (see Figure 5). Especially by the Q50L mutation, the specificity for phytate of 
. fungal consensus phytase was drastically increased (see Figure 6). 
io [0070] The temperature optimum of fungal consensus phytase (70 °C) was 15-25 °C higher than the temperature 
optimum of the wild-type phytases (4555 °C) which were used to calculate the consensus sequence (see Table 5 and 
Figure 3). 

Example 10 

75 

Determination of the melting point by differential scanning calorimetry (DSC) 

[0071] In order to determine the unfolding temperature of the fungal consensus phytases, differential scanning ca- 
lorimetry was applied as previously published by Brugger etal. (1 997). Solutions of 50-60 mg/ml homogeneous phytase 
20 were used for the tests. A constant heating rate of 10 °C/min was applied up to 90 °C. 

[0072] The determined melting points clearly show the strongly improved thermostability of the fungal consensus 
phytase in comparison to the wild-type phytases (see Table 5 and Figure 7). Figure 7 shows the melting profile of fungal 
consensus phytase and its mutant Q50T. Its common melting point was determined between 78 to 79 °C. 
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Table 1 



Aspergillus terreus cbs'\ 16.46 phytase 
Aspergillus nigervat. awamori phytase 
Aspergillus niger 12M phytase 



Aspergillus terreus 9A-1 phytase 



0.50 
0.50 



55 



0.3333 
0.3333 
0.3333 



Aspergillus n/perNRRL3135 phytase 
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Table 1 (continued) 



Aspergillus fumigatus ATCC 1 3073 phytase 


i 

> 
• 


0,20 


Aspergillus fumigatus ATCC 32722 phytase 


i 

: 

i 

i 

• 
■ 
< 
• 

! 


0.20 


Aspergillus fumigatus ATCC 581 28 phytase 


0.20 


Aspergillus fumigatus ATCC 26906 phytase 


J 

■ 

• 
• 
• 


0.20 


Aspergillus fumigatus ATCC 32239 phytase 


• 

i 

■ 
■ 


0.20 


Aspergillus nidulans phytase 


i 

■ 

i 

1 


1.00 


Talaromyces thermophilus ATCC 20186 phytase 


1.00 


Myceliophthora thermophila phytase 




1.00 
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Table 2 



% identity 





A 

terreus 
9A-1 


A terreus 
cbsll6.46 


A niger 

NRRL 

3135 


A 

fumiga- 
tus 

13073 


A 

nidulans 


T. 

thermo- 
philic 


M. ther- 
mophila 


A terreus 
9A-1 




89.1 


62.0 


60.6 


59.3 


58.3 


48.6 


A terreus 
cbs 


90.7 




63.6 


62.0 


61.2 


59.7 


49.1 


A niger 

NRRL 

3135 


67.3 


68.9 


• 


66.8 


64.2 


62.5 


49.4 


A 

fumiga- 
tus 

13073 


66.1 


67.2 


71.1 




68.0 


62.6 


53.0 


A 

nidulans 


65.0 


66.7 


69.0 


73.3 


' i .» 
i 


60.5 


52.5 


T. 

thermo- 
philus 


63.8 


64.5 


68.9 


68.1 


67.4 




49.8 


M. ther- 
mophila 


53.7 


54.6 


57.6 


61.0 


59.9 


57.8 





% similarity 



Table 3: 



Phytase 


Identity |%] 


Similarity [%] 


A niger T213 


76.5 


79.6 


A. niger var. awamori 


76.6 


79.6 


A. niger NRRL31 35 


76.6 


79.4 
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Table 3 


: (continued) 




Phytase 


Identity [%] 


Similarity [%] 


A. nidulans 


77.4 


81.5 


A. terreus 9A-1 


70.7 


74.8 


A. terreus cbsl 16.46 


72.1 


75.9 


/A. fumigatus 1 3073 


80.0 


83.9 


A fumigatus 32239 


78.2 


82.3 


7? thermophilus 


72.7 


76.8 


M. thermophila 


58.3 


64.5 



IS 



so 



25 



Table 4 

mutation Primer set 

Ssp BI 

Q50L 5'-CAC TTG TGG GGT TTGTACAGT CCA TAC TTC TC-3' 

5'-GAG AAG TAT GGA CTG TAC AAA CCC CAC AAG TG-3' 

Kpnl 

Q50T 5'-CAC TTG TGG GGT ACC TAC TCT CCA TAC TTC TC-3' 

5'-GA GAA GTA TGG AGA GTA GGT ACC CCA CAA GTG-3' 



30 



Q50G 5'-CAC TTG TGG GGT GGT TAC TCT CCA TAC TTC TC-3' 

5'-GA GAA GTA TGG AGA GTA ACC ACC CCA CAA GTG-3' 



3S 



Kpnl 

Q50T-Y51N 5'-CAC TTG TGG GGT ACC AAC TCT CCA TAC TTC TC-3' 

40 5'-GA GAA GTA TGG AGA GTT GGT ACC CCA CAA GTG-3' 

Bsal 

Q50L-Y51N 5'-CAC TTG TGG GGT CTC AAC TCT CCA TAC TTC TC-3' 

5'-GA GAA GTA TGG AGA GTT GAG ACC CCA CAA GTG-3' 



Table 5 



phytase 


temperature optimum 


7m a 


Consensus phytase 


70 °C 


78.0 °C 


A. n/perNRRL3135 


55°C 


63.3°C 


A. fumigatus ^ 307 3 


55°C 


62.5°C 


A. terreus 9A-1 


49°C 


57.5°C 


A. terreus cbs 


45°C 


58.5°C 
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Table 5 (continued) 



phytase 


temperature optimum 


7m a 


A. nidulans 
M. thermophila 


45°C 
55 °C 


55.7 °C 

• 



Claims 

jo 1 . A process tor the preparation of a consensus protein, whereby such process is characterized by the following steps: 

a) at least three, preferably four amino acid sequences are aligned by any standard alignment program known 
in the art; 

7 5 b) amino acids at the same position according to such alignment are compared regarding their evolutionary 

similarity by any standard program known in the art, whereas the degree of similarity provided by such a 
program which defines the least similarity of the amino acids that is used for the determination of an amino 
acid of corresponding positions is set to a less stringent number and the parameters are set in such a way 
that it is possible for the program to determine from only 2 identical amino acids at a corresponding position 

20 an amino acid for the consensus protein; however, if among the compared amino acid sequences are se- 

quences that show a much higher degree of similarity to each other than to the residual sequences, these 
sequences are represented by their consensus sequence determined as defined in the same way as in the 
present process for the consensus sequence of the consensus protein or a vote weight of 1 divided by the 
number of such sequences is assigned to every of those sequences. 

25 

c) in case no common amino acid at a defined position is identified by the program, any of the amino acids, 
preferably the most frequent amino acid of all such sequences is selected; 

d) once the consensus sequence has been defined, such sequence is back-translated into a DNA sequence, 
30 preferably by using a codon frequency table of the organism in which expression should take place; 

e) the DNA sequence is synthesized by methods known in the art and used either integrated into a suitable 
expression vector or by itself to transform an appropriate host cell; 

3S f) the transformed host cell is grown under suitable culture conditions and the consensus protein is isolated 

from the host cell or its culture medium by methods known in the art. 

2. A process as claimed in claim 1 wherein the program used for the comparison of amino acids at a defined position 
regarding their evolutionary similarity is the program "PRETTY". 

40 

3. A process as claimed in claim 1 or 2, wherein the defined protein family is the family of phytases. 

4. A process as claimed in claim 3, wherein the phytases are of fungal origin. 

45 5. A process as claimed in any one of claims 1 to 4, wherein the host cell is of eukaryotic origin. 

6, A process as claimed in claim 5, wherein eukaryotic means fungal, preferably Aspergillus or yeast, preferably 
Saccharomyces or Hansenula. 

so 7. A consensus protein obtainable, preferably obtained by a process as claimed in any one of claims 1 to 6. 

8. A consensus protein which has the amino acid sequence shown in Figure 2 or any variants or muteins thereof. 

9. A mutein of the consensus protein of claim 8 characterized therein that in the amino acid sequence of Figure 2 
55 the following replacements have been effected Q50L, Q50T, Q50G, Q50T-Y51 N or Q50L-Q51 N. 

10. A food, feed or pharmaceutical composition comprising a consensus protein as claimed in any of the claims 7 to 9. 



18 



EP 0 897 985 A2 



Figure 1/1 



1 50 

A. terreus 9A-1- KhsDCNSVDh GYQCFPELSH kWGlYAPYFS LQDESPFPLD VPEDChlTFV 

A. terreus Cb3 NhsDCTSVDr GYQCFPELSH kWGlYAPYFS LQDESPFP1D VPDDChlTFV 

A. niger var. awamori NqsTCDTVDQ GYQCFSETSH LWGQYAPFFS LANESAISPD VPAGCrVTFA 

A. niger T213 NqsSCDTVDQ GYQCFSETSH LWGQYAPFFS LANESVISPO VPAGCrVTFA 

A. niger NRRL3135 NqsSCDTVDQ GYQCFSETSH LWGQYAPFFS LANESVISPE VPAGCrVTFA 

A. fumigatus 13073 GSkSCDTVOX GYQCsPATSH LWGQYSPFFS LEDE1SVSSK LPKDCrlTLV 

A. fumigatus 32722 GSkSCDTVDl GYQCsPATSH LWGQYSPFFS LEDE1SVSSK LPKDCrlTLV 

A. fumigatus 58128 GSkSCDTVDl GYQCsPATSH LWGQYSPFFS LEDE1SVSSK LPKDCrlTLV 

A. fumigatus 26906 GSkSCDTVDl GYQCsPATSH LWGQYSPFFS LEDE1SVSSK LPKDCrlTLV 

A. fumigatus 32239 GSkACDTVEl GYQCsPGTSH LWGQYSPFFS LEDE1SVSSD LPKDCrVTFV 

A. nidulans QNHSCNTADG GYQCFPNVSH VWGQYSPYFS IEQESAISeD VPHGCeVTFV 

T. thermophilus DSHSCNTVEG GYQCrPEISH sWGQYSPFFS LADQSEISPD VPQNCklTFV 

M. thermophila ESRPCDTpDl GFQCgTAISH FWGQYSPYFS VpSElDaS.. IPDDCeVTFA 

Consensus NSHSCDTVDG GYQCFPEISH LWGQYSPYFS LEDESAISPD VPDDC-VTFV 

Consensus phytas© NSHSCDTVDG GYQCFPEISH LWGQYSPYFS LEDESAISPD VPDDCKVTFV 



A. terreus 9A-1 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus 13073 

A. fumigatus 32722 

A. fumigatus 58128 

A, fumigatus 26906 

A. fumigatus 32239 

A. nidulans 

?. thermophilus 

M. thermophila 

Consensus 
Consensus phytaaa 
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QVLARHGARs 
QVLARHGARs 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QLLSRHGARY 
QVLSRHGARa 



PThSKtKAYA 
PTDSKtKAYA 
PTESKgKkYS 
PTESKgKkYS 
PTDSKgKkYS 
PTSSKsKkYK 
PTSSKsKkYK 
PTSSKsKkYK 
PTSSKsKkYK 
PTASKsKkYK 
PTESKsKAYS 
PTSSKtElYS 
PTIKRaaSYv 



AtlAAIQKSA 
AtlAAIQKNA 
ALIEEIQQNV 
ALIEEIQQNV 
ALIEEIQQNA 
kLVTAIQaNA 
kLVTAIQaNA 
kLVTAIQaNA 
kLVTAIQaNA 
kLVTAIQKNA 
GLIEAIQKNA 
QLISrIQKTA 
DLIDrlHhGA 



TaFpGKYAFL 
TaLpGKYAFL 
TtFDGKYAFL 
TtFDGKYAFL 
TtFDGKYAFL 
TdFKGKFAFL 
TdFKGKFAFL 
TdFKGKFAFL 
TdFKGKFAFL 
TeFKGKFAFL 
TsFwGQYAFL 
TaYKGyYAFL 
IsYgPgYEFL 
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QSYNYSLDSE 
KSYNYSMGSE 
KTYNYSLGAD 
KTYNYSLGAD 
KTYNYSLGAD 
KTYNYTLGAD 
KTYNYTLGAD 
KTYNYTLGAD 
KTYNYTLGAD 
ETYNYTLGAD 
ESYNYTLGAD 
KDYrYqLGAN 
RTYDYTLGAD 



QVLSRHGARY PTSSK-KAYS ALIEAIQKNA T-FKGKYAFL KTYNYTLGAD 
QVLSRHGARY PTSSKSKAYS ALIEAIQKNA TAFKGKYAFL KTYNYTLGAD 



A. terreus 9A-1 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus 13073 

A. fumigatus 32722 

A. fumigatus 58128 

A. fumigatus 26906 

A. fumigatus 32239 

A. nidulans 

T. thermophilus 

M. thermophila 

Consensus 
Consensus phytase 
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ELTPFGrNQL 
NLTPFGrNQL 
DLTPFGEQEL 
DLTPFGEQEL 
DLTPFGEQEL 
DLTPFGEQQL 
DLTPFGEQQL 
DLTPFGEQQL 
DLTAFGEQQL 
DLTPFGEQQM 
DLTiFGENQM 
DLTPFGENQM 
ELTRtGQQQM 



rDlGaQFYeR 
qDlGaQFYRR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQK 
VDSGaKFYRR 
IQIGIKFYnH 
VNSGIKFYRR 



YNALTRhlnP 
YDTLTRhlnP 
YESLTRNIIP 
YESLTRNI I P 
YESLTRNIVP 
YKALARSWP 
YKALARSWP 
YKALARSWP 
YKALARSWP 
YKALAgSWP 
YKNLARKnTP 
YKSLARNaVP 
YRALARKsIP 



FVRATDASRV 
FVRAADSSRV 
FIRSSGSSRV 
FIRSSGSSRV 
FIRSSGSSRV 
FIRASGSDRV 
FIRASGSDRV 
FIRASGSDRV 
FIRASGSDRV 
FIRSSGSDRV 
FIRASGSDRV 
FVRCSGSDRV 
FVRTAGqDRV 



150 

hESAEKFVEG 
hESAEKFVEG 
IASGEKFIEG 
IASGEKFIEG 
IASGKKFIEG 
IASGEKFIEG 
IASGEKFIEG 
IASGEKFIEG 
IASGEKFIEG 
IASGEKFIEG 
VASAEKFING 
IASGrlFIEG 
VhSAENFTQG 



DLTPFGENQM VNSGIKFYRR YKALARK-VP FVRASGSDRV IASAEKFIEG 
DLTPFGENQM VNSGIKFYRR YKALARKXVP FIRASGSDRV IASAEKFIEG 
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Figure 1/2 



Cerreus 9A-1 
terreus cbs 
niger var. awamori 
niger T213 
niger NRRL3135 
A. fumigacus 13073 
A. fumigacus 32722 
fumigacus 58123 
fumigacus 26906 
fumigacus 32239 
nidulans 
chermophilus 
checmophila 



A. 
A. 
A. 
A. 
A. 



A. 
A. 
A. 
A. 

r. 
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FQTARqDDHh 
FQNARqGDPh 
FQSTKLkDPr 
FQSTKLkDPr 
FQSTKLkDPr 
FQqAKLADPG 
FQqAKLADPG 
FQqAKLADPG 
FQqAKLADPG 
FQqANVADPG 
FRKAQLhCHG 
FQSAKVlDPh 
FHSA1LADRG 



ANpHQPSPrV 
ANpHQPSPrV 
AqpgQSSPkl 
AqpgQSSPkl 
AqpgQSSPkl 

a.tnraapa: 
a.tnraapa: 
a. tnraapai 
a.tnraapai 

A.TNRAAPVI 
S. . 7QAT ?VV 
SDkHDAPPTl 
STvRPTlPyd 



DValPEGSAY 
DVVIPEGTAY 
DVVISEASSs 
DVVISEASSs 
DVVISEASSs 
SVIIPESETF 
SVIIPESETF 
SVIIPESETF 
SVIIPESETF 
SVIIPESETY 
NVIIPZiDGF 
NVIIeEGPSY 
nVVIPETAGa 



NNTLEHS1CT 
NNTLEHSICT 
NNTLDPGTCT 
NNTLDPGTCT 
NNTLDPGTCT 
NNTLDHGVCT 
NNTLDHGVCT 
NNTLDHGVCT 
NNTLDHGVCT 
NNTLDHSVCT 
NNTLDH3TCV 
NNTLDtGSC? 
NNTLHNDiCT 
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AFES . . .STV 
AFEA. • .STV 
VFED. . . SEL 
VFED. . .SEL 
VFED. . .SEL 
kFEA. . .SQL 
kFEA . . . SQL 
kFEA. . . SQL 
kFEA . . . SQL 
NFEA. . .SEL 
SFEN. . .DEr 
VFED. . .SSg 
AFEEgpySTI 



Consensus 
Consensus phytaae 



FQSAKLADPG S-PHQASPVI NVIIPEGSGY NN7LDHGTCT AFED SEL 
FQSAKI4ADPG SQPHQASPVI DVIIPE5SGY NNTLDHGTCT AFED... SEL 



A. Cerreus 9A-1 

A. cerreus cbs 

A. niger var. awamori 

A. niger T.213 

A. niger NRRL3135 

A. fumigacus 13073 

A. fumigatus 32722 

A. fumigacus 58128 

A. fumigacus 26906 

A. fumigacus 32239 

A. nidulans 

T. chermophilus 

M. thermophila 

Consensus 
Consensus phytase 
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GDDAvANFTA 
GDAAADNFTA 
ADTVEANFTA 
ADTVEAN FT A 
ADTVEANFTA 
GDEVAANFTA 
GDEVAANFTA 
GDEVAANFTA 
GDEVAANFTA 
GDEVEANFTA 
ADEiEANFTA 
GHDAQEKFAk 
GDDAQDTY1S 



VFAPAIaQRL 
VFAPAIakRL 
TFAPSIRQRL 
TFAPSIRQRL 
TFVPSIRQRL 
IFAPDIRARa 
lFAPDtRARa 
IFAPDIRARa 
lFAPDIRARa 
1FAPAIRARI 
IMGPPIRkRL 
qFAPAHEKI 
TFAGPItARV 



EADLPGVqLS 

EADLPGVqLS 

ENDLSGVTLT 

ENDLSGVTLT 

ENDLSGVTLT 

EkHLPGVTLT 

EkHLPGVTLT 

EkHLPGVTLT 

KkHLPGVTLT 

EkHLPGVqLT 

ENDLPGIKLT 

KDHLPGVDLA 

NANLPGANLT 



TDDVVnLMAM 
ADDVVnLMAM 
DTEVTyLMDM 
DTEVTyLMDM 
DTEVTyLMDM 
DEDVVsLMDM 
DEDVVsLMDM 
DEDVVsLMDM 
DEDVVsLMDM 
DDDWsLMDM 
NENVIyLMDM 
vSDVpyLMDL 
DADTVaLMDL 
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CPFETVS1TD 
CPFETVS1TD 
CSFDTIStST 
CSFDTIStST 
CSFDTIStST 
CSFDTVARTS 
CS FDTVARTS 
CSFDTVARTS 
CSFDTVARTS 
CSFDTVARTA 
CSFDTMARTA 
CPFETLARNh 
CPFETVAsSS 



GDDAEAN FTA TFAPAIRARL EADLPGVTLT DEDVV-LMDM CFFETVARTS 
GDDVEANFTA LFAPAIRARL EADLPGVTLT DEDWYLMDM CPFETVARTS 



25: 300 

A Cerreus 9A-1 - DAhTLSPFC DLFTAtEWtq YNYL1SLDKY YGYGGGNPLG 

A* terreus cbs DAhTLSPFC DLFTAaEWtq YNYL1SLDKY YGYGGGNPLG 

a! niger var. awamori vDTKLSPFC DLFTHdEWih YDYLQSLkKY YGHGAGNPLG 

A -ice' T213 vDTKLSPFC DLFTHdEWih YDYLRSLkKY YGHGAGNPLG 

a' -icer NRRL3135 vDTKLSPFC DLFTHdEWin YDYLQSLkKY YGHGAGNPLG 

A* fumigacus 13073 DASQLSPFC QLFTHnEWkk YNYLQSLGKY YGYGAGNPLG 

A* fumigacus 32722 DASQLSPFC QLFTHnEWkk YNYLQSLGKY YGYGAGNPLG 

A* fumica-us 58128 DASQLSPFC QLFTHnEWkk YNYLQSLGKY YGYGAGNPLG 

A fumica-us 26906 DASQLSPFC QLFTHnEWkk YNYLQSLGKY YGYGAGNPLG 

A fum<aatus 32239 DASELSPFC AI FTHnEWkk YDYLQSLGKY YGYGAGNPLG 

A. nidulans HGTELSPFC AI FTEkEWlq ^DYLQSLSKY ^YGAGSPLG 

t chermophilus TDT.LSPFC ALsTQeEWqa YDYYQSLGKY YGnGGuNPLG 

H. Chermophila sdpazadagg gNGrpLSFFC rLFSEsEWra YDYLQSVGKW YGYGFGNPLG 

Consensus -DATELSPFC ALFTE-EW-- YDYLQSLGKY YGYGAGNPLG 

Consensus phytase DATELSPFC ALFTHDEWRQ YDYLQSLGKY YGYGAGNPLG 
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Figure 1/3 

301 350 

A. terreus 9A-1 PVQGVGWaNE LMARLTRAPV HDHTCVNNTL DA5PATFPLN ATLYADFSHD 

A. cerreus cbs PVQGVGWaNE LIARLTRSPV HDHTCVNNTL DANPATFPLN ATLYADFSHD 

A. niger var. awamori PTQGVGYaNE LIARLTHSPV HDDTSSNHTL DSNPATFPLN STLYADFSHD 

A. niger T213 PTQGVGYaNE LIARLTHSPV HDDTSSNHTL DSNPATFPLN STLYADFSHD 

A. niger NRRL3135 PTQGVGYaNE LIARLTHSPV HDDTSSNHTL DSSPATFPLN STLYADFSHD 

A, fumigatus 13073 PAQGIGFtNE LIARLTRSPV QDHTSTNsTL vSNPATFPLN ATMYVDFSHD 

A. fumigaCUS 32722 PAQGIGFtNE LIARLTRSPV QDHTSTNsTL vSNPATFPLN ATMYVDFSHD 

A. fumigatus 58128 PAQGIGFtNE LIARLTRSPV QDHTSTNsTL vSNPATFPLN ATMYVDFSHD 

A. fumigatus 26906 PAQGIGFtNE LIARLTRSPV QDHTSTNsTL vSNPATFPLN ATMYVDFSHD 

A. fumigatus 32239 PAQGIGFtNE LIARLTHSPV QDHTSTNsTL DSDPATFPLN ATI YVDFSHC 

A. nidulans PAQGIGFtNE LIARLTQSPV QDNTSTNKTL DSNPATFPLD r KLYADF3HD 

T, thermophilus PAQGVGFvNE LIARMTHSPV QDYTTVNHTL DSNPATFPLN ATLYADFSHD 

M. thermophila PTQGVGF vNE LLARLAqvPV RDgTSTNRTL DGDPrTFPLG rPLYADFSHD 

Consensus PAQGVGF-NE LIARLTHSPV QDKTSTNHTL DSNPATFPLN ATLYADFSHD 

Consonsus phytase PAQCVGFANE LIARLTRSPV QDHTSTNHTL DSNPATFPLN ATLYADFSHD 



351 400 

A. terreus 9A-1 SNLVSIFWAL GLYNGTAPLS qTSVESVSQT DGYAAAWTV? FAARAYVEKM 

A. terreus cbs SNLVSIFWAL GLYNGTkPLS qTTVEDITrT DGYAAAWTVP F AARAY I EMM 
A. niger var. awamori NGIISILFAL GLYNGTkPLS TTTVENITQT DGF5SAWTVP FASRl YVEMM 

A. niger T213 NGIISILFAL GLYNGTkPLS TTTVENITQT DGFSSAWTVP FASRl YVEMM 

A. niger NRRL3135 NGIISILFAL GLYNGTkPLS TTTVENITQT DGFSSAWTVP FASR1YVEMM , 

A. fumigatus 13073 NSMVSI FFAL GLYNGTEPLS rTSVESaKEl DGYSASWVVP FGARAYFEtM 

A. fumigatus 32722 NSMVSI FFAL GLYNGTGPLS rTSVESaKEl DGYSASWVVP FGARAYFEtM 

A. fumigatus 58128 N5MVS I FFAL GLYNGTEPLS rTSVESaKEl DGYSASWVVP FGARAYFEtM 

A. fumigatus 26906 NSMVSI FFAL GLYNGTEPLS rTSVESaKEl DGYSASWVVP FGAFAYFEtM 

A. fumigatus 32239 NGMIPIFFAM GLYNGTEPLS qTSeESTKES NGYSASWAVP FGARAYFEtM 

A. nidulans NSMISIFFAM GLYNGTQPLS mDSVESIQEm DGYAASWTVP FGARAYFELM 

T. thermophilus NTMTSIFaAL GLYNGTAkLS TTEIKSIEET DGYSAAWTVP FGGRAYIEMM 

M- Chermophila NDMMGVLgAL GaYDGVPPLD KTArrDpEEl GGYAASWAVP FAARiYVEKM 

Consensus NSMISI FFAL GLYNGTAPLS TTSVESIEET DGYAASWTVP FGARA YVEMM 

Consensus phytase NSMISI FFAL GLYNGTAPLS TTSVESIEET DGYSASWTVP FGARAYVEMM 



A . cerreus 9A-1 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A . fumigatus 13073 

A. fumigatus 32 7 22 

A. fumigatus 53128 

A. fumigatus 26906 

A. fumigatus 32239 

A . nidulans 

T. thermophilus 

M. thermophila 

Consensus 
Consensus phytase 
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QC RAEKE PLVRVLVNDR VMPLHGCPTD KLGRCKrDAF 

QC RAEKQ PLVRVLVNDR VMPLHGCAVD NLGRCKrDDF 

QC QAEQE PLVRVLVNDR WPLHGCPID aLGRCTr DSF 

QC QAEQE PLVRVLVNDR WPLHGCPID aLGRCTr DSF 

QC QAEQE PLVRVLVNDR VVPLHGCPVD aLGRCTr DSF 

QC KSEKE PLVRALINDR VVPLHGCDVD KLGRCKLNDF 

QC KSEKE PLVRALINDR VVPLHGCDVD KLGRCKLNDF 

QC KSEKE SLVRALINDR VVPLHGCDVD KLGRCKLNDF 

QC KSEKE PLVRALINDR VVPLHGCDVD KLGRCKLNDF 

QC KSEKE PLVRALINDR VVPLHGCAVD KLGRCKLKDF 

QC E.KKE PLVRVLVNDR VVPLHGCAVD KFGRCTLDpW 

QC...! DDSDE PVVRVLVNDR WPLHGCEVD SLGRCKrDDF 

RCsggggggg ggegrQEKDE eMVRVLVNDR VMTLkGCGAD ErGMCTLErF 

QC- QAEKE PLVRVLVNDR VVPLHGCAVD KLGRCKLDDF 

QC QAEKE PLVRVLVNDR VVPLHGCAVD KLGRCKRDDF 
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A. terreus SA-1 

A. terreus cbs 

A, niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fuinigatus 130*73 

A. fumigatus 32722 

A. funrigacus 58128 

A. fumigatus 26906 

A. fuinigacus 32239 

A. nidulans 

T. thermophilics 

M. thermophila 

Consensus 
Consensus phytase 



Figure 1/4 

451 47 
VAGLSFAQAG GNWADCF- — 

VEGLSFARAG GNWAECF 

VrGLSFARSG GDWAECsA— 
VrGLSFARSG GDWAECFA— 
VrGLSFARSG GDWAECFA — 
VKGLSWARSG GNWGECFS-- 
VKGLSWARSG GNWGECFS — 
VKGLSWARSG GNWGECFS — 
VKGLSWARSG GNWGECFS — 
VKGLSWARSG GNSEQSFS — 
VEGLNFARSG GNWkTCFTl- 
VrGLSFARqG GNWEGCYAas 
IESMAFARGN GKWD1CFA-- 

VEGLSFARSG GNWAECFA-- 
VEGLSFARSG GNWAECFA. . 
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Figure 2/1 

C?-l 

Eco RIMGVFVVLLSIATLFGST 
TATATGAATTCATGGGCGTGTTCGTCCTGCTACTGTCCATTGCCACCTTGTTCGGTTCCA 
I + + + + 4 + go 

ATATACTTAAGTACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGT 

SGTALGPRGNSHSCDTVDGG 

CATCCGGTACCGCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTG 
61 + + + + 4 + 120 

GTAGGCCATGGCGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCAC 

CP-2 

CP-3 ; 
YQCFPEISHLWGQYSPYFSL 
GTTACCAATGTTTCCCAGAAATTTCTCACTTGTGGGGTCAATACTCTCCATACTTCTCTT 

121 — + + - + + + 183' 

CAATGGTTACAAAGGGTCTTTAAAGAGTGAACACCCCAGTTATGAGAGGTATGAAGAGAA 

EDESAISPDVPDDCRVTFVQ 

TGGAAGACGAATCTGC TATTTCTCCAGAC GTTC C AG ACG AC TG TAGAG T T ACTT TC GT TC 
191 + * f + + 240 

ACCTTCTGCTTAGACGATAAAGAGGTCTGCAAGGTCTGCTGACATCTCAATGAAAGCAAG 

CP-4 

CP-5 

VLSRHGARYPTSSKSKAYSA 
AAGTTTTGTCTAGACACGGTGCTAGATACCCAACTTCTTCTAAGTCTAAGGCTTACTCTG 
241 + * + * + * 300 

TTCAAAACAGATCTGTGCCACGATCTATGGGTTGAAGAAGATTCAGATTCCGAATGAGAC 

LI EAIQKNATAFKGKYAFLX 
CTTTGATTGAAGCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGA 

301 + + + + + + 360 

GAAACTAACTTCGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACT 

CP-6 

CP-7 

TYNYTLGADDLTPFGENQMV 

AGACTTACAACTACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAAAACCAAATGG 
361 + + + + + ♦ 420 

TCTGAATGTTGATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTTTGGTTTACC 

NSGIKFYRRYKALARKIVPF 
TTAACTCTGCTATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCAT 

421 ♦ + + + + + 480 

AATTGAGACCATAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTA 

CP-8 

CP-9 

IRASGSDRVIASAEKFIEGF 
TCATTAGAGCTTCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTT 

481 + + + + + + 540 

AGTAATCTCGAAGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAA 

QSAKLAOPGSQPHQAS PVID 
TCCAATCTGCTAAGTTGGCTGACCCAGGTTCTCAACCACACCAAGCTTCTCCAGTTATTG 

541 + + + + + ♦ 600 

AGG7TAGACGATTCAACCGACTGGGTCCAAGAGTTGGTCTGGTTCCAAGAGCTCAATAAC 

CP-10 

cp-i: 

v i 1 pegsgynnt1dhgtcta 
acgttattattccagaaggatccggttacaacaacactttggaccacggtacttgtactg 

601 + + + + + * 66C 

TGCAATAATAAGGTCTTCCtAGgCCAATGTTGTTGTGAAACCTGGTGCCATGAACATGAC 
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Figure 2/2 

FEDSELGDDVEANFTALFAP 
CTTTCGAAGACTCTGAATTGGGTGACGACGTTGAAGCTAACTTCACTGCTTTGTTCGCTC 

661 + + + + + + 720 

GAAAGCTTCTGAGACTTAACCCACTGCTGCAACTTCGATTGAAGTGACGAAACAAGCGAG 

CP-12 

AIRARLEADLPGVTLTDEDV 
CAGCTATTAGAGCTAGATTGGAAGCTGACTTGCCAGGTGTTACTTTGACTGACGAAGACG 

721 + + * + + + 780 

GTCGATAATC TCGATCTAACCTTCGACTGAACGGTCCACAATGAAACTGACTGCTTC TGC 

CP-13 

VY LKD'MC P FET V A R T S OAT E 
TTGTTTACTTGATGGACATGTCTCCATTCGAAACTGTTGCTAGAACTTCTGACGCTACTG 

78X + f + r + + 840 

AACAAATeAACTACCTGTACACAGGTAAGCTTTGACAACGATCTTGAAGACTGCGATGAC- 

LSPFCALFTHDEWRQYDYLQ 
AATTGTCTCCATTCTGTGCTTTGTTCACTCACGACGAATGGAGACAATACGACTACTTGC 

841 + + + - ; + + 900 

T T AACAGAGGTAAGACACGAAACAAGTGAGT GC TGCT T ACCTC TGTTATGC TGATGAAC G 
CP-14 

CP-15 

SLGKYYGYGAGNPLGPAQGV 
AATCTTTGGGTAAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTG 

901 + + ♦ + + 960 

TTAGAAACCCATTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCAC 

G FAN E LI ARLT RS PVQD H T S 
TTGCTTTCGCTAACGAATTGATTGCTAGArTGACTAGATCTCCAGTTCAAGACCACACTT 

961 + + * ♦ + * 1020 

AACCAAAGCGATTGCTTAACTAACGATCTAACTGATCTAGAGGTCAAGTTCTGGTGTGAA 

CP-16 

CP-17 

TNHTLDSNPATFPLNATLYA 
CTACTAACGACACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACG 

1021 +- — + * + 1080 

GATGATTGGTGTGAAACCTGAGATTGGGTCGATGAAAGGCTAACTTGCGATGAAACATGC 

OFSHDMSKISIFFALGLYNG 
CTGACTTCTCTCACGACAACTCTATGATTTCTATTTTCTTCGCTTTGGGTTTGTACAACG 

X081 ♦ + * + — ~ + 1140 

GACTGAAGAGAGTGCTGTTGAGATACTAAAGATAAAAGAAGCGAAACCCAAACATGTTGC 

CP-13 

CP-19 

TAPLSTT5VESIEETDGYSA 
GTACTGCTCCATTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTG 

1141 - — * * ♦ — — ♦ 1200 

CATGAC GAGGT AACAGATGATGAAGACAAC T T AGAT AACT TC I T TGAC TGCC AA T G AG AC 

S WT VP FGARAYVEMMQCQAE 
CTTCTTGGACTGTTCCATTCGGTGCTAGAGCTTACGTTGAAATGATGCAATGTCAAGCTG 

1201 * ** + + T 1260 

GAAGAACCTGACAAGG7AAGCCACGATCTCGAATGCAACTTTACTACGTTACAGTTCGAC 

CP-20 

CP-21 

KEPLVRVLVNDRVVPLHGCA 
AAAAGGAACCATTGG7TAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTG 

126 i + + + + "--+ 1320 

TTTTCCTTGGTAACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACAC 
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Figure 2/3 

VDKLGRCKRDDFVEGLSFAR 
CTGTTGACAAGTTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTA 

1321 + + + + + + 1380 

GACAACTGTTCAACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGAT 

CP-22 

SGGNWAECFA* Eco RI 
GATCTGGTGGTAACTGGGCTGAATGTTTCGCTTAAGAATTCATATA 

1381 + + + + 1426 

CTAGACCACCATTGACCCGACTTACAAAGCGAATTCTTAAGTATAT 



25 



EP 0 897 985 A2 



Figure 3 
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Figure 4 
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Figure 5 
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Figure 6 



o 

CO 




□ 

0 



Consensus Phytase 
Mutant Q50L 
Mutant Q50T 
Mutant Q50G 



i 



ill 



U 



III 



0) 

"co 



0 0) O d) Q) 0) 0) Q) 



CL CL CL CL 

"q_ CO CO CO CO 

o o o o 



a a a o. 

w w w (/) 

o o o o 



0) 
03 
0) 

o 



Q> CL 

"co ^ 

> < 



a. a. 
< < 



O) -2- 



a a a a a a a a o o 



c 

0) 

o 



f 2 <P 

9 w 



a. V ~ 



CD 
CO 

o 
o 



o 



I I I 

CO LO CO 

0) a) o 

co co if 

o o a> 

o x> o 

3 T- >, 



I 

CM 

I 



a 



i i 



a. <D 
co o 
o sz 

SZ CL 

CL CO 

CO ^ 
CL 



29 



EP 0 897 985 A2 



Figure 7 
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